EVISTA – Interactive Visual Clustering System

نویسندگان

  • K. Thangavel
  • P. Alagambigai
چکیده

Due to the enormous increase in the data, exploring and analyzing them is increasingly important but difficult to achieve. Information visualization and visual data mining can help to deal with this. Visual data exploration has a high potential and many applications such as fraud detection and data mining will use information visualization technology for an improved data analysis. The advantage of visual data exploration is that the user is directly involved in the data mining process. There are a large number of information visualization techniques which have been developed over the last decade to support the exploration of large data sets. VISTA is an interactive visual cluster rendering system which invites human into the clustering process, but there are some limitations in identifying the cluster distribution and human-computer interaction. In this paper, we propose an Enhanced VISTA (EVISTA) which addresses these drawbacks. EVISTA improves the visualization in two ways: first it uses the weighted vector normalization instead of max-min normalization, which improves the data visualization such that the user can understand the underlying pattern without human intervention. Secondly it completely eliminates the use of α tuning, which reduces the complexity in visual distance computation and eases the human computer interaction in a better way. The experiment results show that EVISTA explore the underlying pattern of the dataset effectively and reduces the user operation burden greatly.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

iVisClustering: An Interactive Visual Document Clustering via Topic Modeling

Clustering plays an important role in many large-scale data analyses providing users with an overall understanding of their data. Nonetheless, clustering is not an easy task due to noisy features and outliers existing in the data, and thus the clustering results obtained from automatic algorithms often do not make clear sense. To remedy this problem, automatic clustering should be complemented ...

متن کامل

Comparison Between Unsupervised and Supervise Fuzzy Clustering Method in Interactive Mode to Obtain the Best Result for Extract Subtle Patterns from Seismic Facies Maps

Pattern recognition on seismic data is a useful technique for generating seismic facies maps that capture changes in the geological depositional setting. Seismic facies analysis can be performed using the supervised and unsupervised pattern recognition methods. Each of these methods has its own advantages and disadvantages. In this paper, we compared and evaluated the capability of two unsuperv...

متن کامل

The Graph Clustering Problem has a

The input to the Graph Clustering Problem consists of a sequence of integers m 1 ; :::; m t and a sequence of P t i=1 m i graphs. The question is whether the equivalence classes, under the graph isomorphism relation, of the input graphs have sizes which match the input sequence of integers. In this note we show that this problem has a (perfect) zero-knowledge interactive proof system.

متن کامل

An interactive visual testbed system for dimension reduction and clustering of large-scale high-dimensional data

Many of the modern data sets such as text and image data can be represented in high-dimensional vector spaces and have benefited from computational methods that utilize advanced computational methods. Visual analytics approaches have contributed greatly to data understanding and analysis due to their capability of leveraging humans’ ability for quick visual perception. However, visual analytics...

متن کامل

Visualization Support to Interactive Cluster Analysis

We demonstrate interactive visual embedding of partition-based clustering of multidimensional data using methods from the open-source machine learning library Weka. According to the visual analytics paradigm, knowledge is gradually built and refined by a human analyst through iterative application of clustering with different parameter settings and to different data subsets. To show clustering ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009